Using the Crowd to Annotate Metadiscursive Acts
نویسندگان
چکیده
This paper addresses issues relating to the definition and non-expert understanding of metadiscursive acts. We present existing theory on spoken metadiscourse, focusing on one taxonomy that defines metadiscursive concepts in a functional manner, rather than formally. A crowdsourcing annotation task is set up with two main goals: (a) build a corpus of metadiscourse, and (b) assess the understanding of metadiscursive concepts by non-experts. This initial annotation effort focus on five categories of metadiscourse: INTRODUCING TOPIC, CONCLUDING TOPIC, MARKING ASIDES, EXEMPLIFYING, and EMPHASIZING. The crowdsourcing task is described in detail, including instructions and quality insurance mechanisms. We report results in terms of time-on-task, self-reported confidence, requests for additional context, quantity of occurrences and inter-annotator agreement. Results show the crowd is capable of annotating metadiscourse and give insights on the complexity of the different concepts in the taxonomy.
منابع مشابه
metaTED: a Corpus of Metadiscourse for Spoken Language
This paper describes metaTED – a freely available corpus of metadiscursive acts in spoken language collected via crowdsourcing. Metadiscursive acts were annotated on a set of 180 randomly chosen TED talks in English, spanning over different speakers and topics. The taxonomy used for annotation is composed of 16 categories, adapted from Ädel (2010). This adaptation takes into account both the ma...
متن کاملA Data-driven Method for Crowd Simulation using a Holonification Model
In this paper, we present a data-driven method for crowd simulation with holonification model. With this extra module, the accuracy of simulation will increase and it generates more realistic behaviors of agents. First, we show how to use the concept of holon in crowd simulation and how effective it is. For this reason, we use simple rules for holonification. Using real-world data, we model the...
متن کاملPersonalized Human Computation
Significant effort in machine learning and information retrieval has been devoted to identifying personalized content such as recommendations and search results. Personalized human computation has the potential to go beyond existing techniques like collaborative filtering to provide personalized results on demand, over personal data, and for complex tasks. This work-in-progress compares two app...
متن کاملDigital Art and Crowd Creation in Iran (Case Study: Tehran Annual Digital Art Exhibition)
This paper aims to show the status of digital art in Iran and explain how the meaning of an artist has transformed in the digital age. The primary assumption of this paper is that the experience of digital art has again revived the collective experience in creating arts. Although, interactivity is considered to be the most important quality of digital art, their collective, collaborative and pr...
متن کاملCommunicating vagueness by hands and face
This paper investigates the bodily signals of vagueness. After presenting the conceptual notion of vagueness by contrasting with uncertainty, approximation, confusion and ambiguity, the notion of “metadiscursive vagueness signal” is presented, a case of metadiscursive signals that convey “I am being vague”. Then a qualitative analysis of bodily signals of vagueness is presented, while singling ...
متن کامل